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PRODUCTION OF PEPTIDES USING 
RECOMBINANT FUSION PROTEIN CONSTRUCTS 

Background of the Invention 

5 In vitro DNA manipulation and the attendant 

transfer of genetic information have developed into a 
technology that allows the efficient expression of 
endogenous and foreign proteins in microbial' hosts . 
Recombinant DNA techniques have made possible the 

10 selection, amplification and manipulation of expression 
of the proteins and peptides. 

Although expression of any foreign protein in any 
microbial host is theoretically possible, the stability 
of the protein produced often limits such practice and 

15 results in a low yield. In particular, small foreign 
proteins and oligopeptides are not easily overproduced 
in most cellular hosts. Expression of a small peptide 
in a host cell raises the possibility that the host will 
assimilate the polypeptide. 

20 In response to this problem, small peptides have 

been expressed as fusion proteins with a second larger 
peptide, such as a peptide marker (e.g., beta- 
galactosidase or chloramphenicol acetyl transferase) . 
While the resulting fusion protein may not be readily 

25 assimilated,- isolation and purification of the desired 
protein is often not very efficient or effective. 

The development of methods which permit the 
isolation and/or purification of recombinant peptides 
without requiring the use of any column chromatography 

30 separations, e.g., chromatography on an affinity column, 
would be particularly desireable. Such columns are 
typically expensive and may have a relatively limited 
lifetime. Accordingly, there is a continuing need for 
flexible, convenient methods for the isolation and 

3 5 purification of recombinant peptides. 

Summary of the Invention 

The present invention provides a method for the 
production of a recombinant peptide which employs a 

4 0 fusion protein construct including a carbonic anhydrase 



i 
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and a variable fused polypeptide. Typically, the 
carbonic anhydrase is a mammalian carbonic anhydrase 
(mCA) . The carbonic anhydrase and the variable fused 
polypeptide may be linked together by a cleavage site. 
5 The cleavage site is an amino acid or sequence of amino 
acids which results in the protein construct being 
selectively cleaved on treatment by a cleavage agent. 
The cleavage agent may be a chemical reagent, e.g. 
cyanogen bromide, which recognizes a certain chemical 
10 cleavage site such as a methionine residue. The 

cleavage agent may be also be an endopeptidase which 
cleaves the construct at a specific point in relation to 
a particular amino acid or sequence of amino acids ("an 
enzymatic cleavage site") . 
15 One embodiment of the invention is directed to a 

method of producing a peptide j which includes 
precipitating the fusion protein construct. The 
precipitated construct may be resolubilized. After 
cleavage, the fragment containing the carbonic anhydrase 
2 0 ("carbonic anhydrase fragment") may be removed by 
chromatography, filtration or, preferably, by 
precipitation. 

A second embodiment provides a method of producing 
a peptide which includes (i) cleaving the fusion protein 
25 construct to produce a soluble carbonic anhydrase 
fragment and a soluble variable fused polypeptide 
fragment, and (ii) precipitating the carbonic anhydrase 
fragment. The carbonic anhydrase fragment includes the 
carbonic anhydrase and may also include other amino 
30 acids residues, e.g., from an interconnecting peptide or 
a chemical moiety created by the cleavage reaction. The 
variable fused polypeptide fragment includes at least 
one copy of a target peptide and may also include 
additional amino acid residues such as an N-terminal or 
35 C-terminal tail sequence or an intraconnecting peptide. 
Where the variable fused polypeptide fragment includes 
more than one copy of a target peptide, the target 



WO 96/17942 



PCIYUS95/15800 



3 

peptides are typically linked by an intraconnecting 
peptide which includes a cleavage site. 

A third embodiment of the invention provides a 
method of producing a peptide which includes expressing 
5 the fusion protein construct as part of an inclusion 
body in a host cell, e.g., an E. coli host cell, and 
isolating the fusion protein construct. The fusion 
protein construct typically includes at least 2 copies 
of a target peptide. However, if the fusion protein 
10 construct includes target peptide corresponding to a 

peptide selected from the group consisting of GRF(l-44) 
(SEQ ID NO:20), GLPl(7-34) (SEQ ID NO:21) and PTH(l-34) 
(SEQ ID NO; 22), a construct having only a single copy of 
the target peptide may be expressed as a part of an 
15 inclusion body. GRF(1-41) (SEQ ID NO:23) , GLPM7-34) 
(SEQ ID NO:21) and PTH(l-34) (SEQ ID NO:22) represent 
peptides having the designated amino acids corresponding 
to growth hormone releasing factor, glucagon- like 
peptide 1 and parathyroid hormone respectively. The 
20 sequences for amino acids 1-44 of growth hormone 

releasing factor (GRFd-44) (SEQ ID NO:20)) and amino 
acids 1-36 of Glucagon-like Peptide 1 (GLPKl-36) (SEQ 
ID NO:24)) are disclosed in International Application 
No. PCT/US94/08125, the disclosure of which is 
25 incorporated herein by reference. The sequence for 

amino acids 1-84 of human parathyroid hormone (PTHd-84) 
(SEQ ID NO:25)) is disclosed in Hendy et al . , 
Proc . Nat 1 , Acad , Sci . , USA , 78, 7365 (1981) and T. Kimura 
et al., BBRC , 114 , 493 (1983), the disclosure of which 
30 is incorporated herein by reference. 

Another embodiment of the invention provides a 
method of producing a peptide which includes cleaving 
the fusion protein construct to produce a carbonic 
anhydrase fragment and a variable fused polypeptide 
3 5 fragment. The carbonic anhydrase fragment and the 

variable fused polypeptide fragment are precipitated. 
The precipitated fragments may be extacted with with a 
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solvent which includes an organic component to recover 
the variable fused polypeptide fragment. 

The invention also provides an inclusion body 
containing the fusion protein construct expressed in a 
5 host cell, e.g., an E. coli host cell. The fusion 
protein construct includes a carbonic anhydrase and a 
target peptide. The target peptide preferably includes 
at least one copy of an amino acid sequence 
corresponding to a peptide selected from the group 

10 consisting of GRF(1-41) (SEQ ID NO:23) , GLPK7-34) (SEQ 
ID N0:21) and PTHU-34) (SEQ ID NO:22) . 

Finally, the invention also provides a fusion 
protein construct which includes the carbonic anhydrase 
and a target peptide. The target peptide includes an 

15 amino acid sequence corresponding to a peptide selected 
from the group consisting of GRF(1-41) (SEQ ID NO:23), 
GLPK7-34) (SEQ ID N0:21) and PTH(l-34) (SEQ ID NO:22). 



Brief Description of the Figures 

20 FIG. 1 depicts the various formulas for variable 

fused polypeptides formed of multiple units of target 
peptides. 



Detailed Description of the Invention 

25 The expression of a foreign protein by interaction 

of recombinant vectors with the biosynthetic machinery 
of host cells or host organisms is a well-known 
technique for biochemical protein synthesis. The 
present invention utilizes a novel modification of this 

30 expression technique in combination with a carbonic 

anhydrase -based fusion protein construct to establish a 
new, low cost, highly efficient method for the large 
scale biological synthesis of peptides. The method 
makes it possible to design processes for the isolation 

3 5 and/or purification of a recombinant peptide based 
solely on chemical methods, i.e. without requiring 
affinity chromatography or other chromatographic steps. 
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The fusion protein construct also permits the use of a 
ligand immobilized affinity separation technique if 
desired. The method eliminates expensive machinery and 
reagents, long synthetic times, low reaction efficiency 
5 and produces fewer faulty copies and a higher yield of 
the product peptide, than the solid phase peptide 
synthesis. 

The method of the invention incorporates a 
combination of cellular factors and biologic 

10 compositions that enable ready expression of foreign 
proteins by biological systems. Because the expressed 
fusion protein construct incorporates a binding protein, 
carbonic anhydrase, subversion of the particular 
identity of the target peptide by the expression 

15 mechanism of the cell or organism is not permitted. 
Variation in expression efficiency is minimized. The 
preferred inductive expression mechanism allows 
production of. peptide constructs that might otherwise be 
toxic to host cells. 

20 The method of the invention also incorporates 

factors that contribute substantially to the efficiency, 
capacity and yield of the purification technique. The 
solubility properties of the carbonic anhydrase and 
fusion construct including the carbonic anhydrase 

25 facilitate a clean, complete separation of the variable 
fused polypeptide from other constituents. The 
relatively low molecular weight of the carbonic 
anhydrase permits a high capacity and efficiency per 
unit weight of the fusion construct. Because the method 

3 0 does not require any special reagents or reactions, the 
production of undesirable side products is avoided. 

The size of the fusion protein construct will vary 
depending on the nature and number of copies of the 
target peptide. The fusion protein construct is large 

35 enough to avoid degradation by the host cell (e.g., at 
least about 60 to 80 amino acid residues) and not so 
large that it can not be effectively expressed by the 
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host cell. As a practical matter, the fusion protein 
construct will have a molecular weight of up to about 
500,000 although larger constructs are also within the 
scope of the present invention. The size of the fusion 
5 protein construct is chosen such that it may be 

expressed by the host cell so as to avoid introducing 
errors in the protein sequence. This places practical 
limitations on the number of copies of the target 
peptide present in a given construct. The actual number 
10 will vary depending on the size and nature of a 

particular target peptide within the limitations set by 
the factors discussed above. 

Carbonic Anhvdrase 
15 The carbonic anhydrase may be derived from a 

variety of sources. Suitable sources include 
vertebrates and in particular mammalian sources, e.g., a 
human carbonic anhydrase (hCA) , a rat carbonic 
anhydrase, a feline carbonic anhydrase, an equine 

2 0 carbonic anhydrase or a -bovine carbonic anhydrase. A 

carbonic anhydrase derived from other mammalian species 
such as ovine, murine, porcine and monkey, may also be 
used to form the fusion protein construct of the present 
invention. An example of a suitable carbonic anhydrase 

25 is human carbonic anhydrase II ("hCAII"; see, Taylor et 
al. f Biochemistry , 9, 2638 (1970)). 

The carbonic anhydrase may also be a modified 
functional version of a carbonic anhydrase. The 
modified functional version will retain a carbonic 

30 anhydrase' s capability of being precipitated as a 

function of solution conditions, e.g. at a particular pH 
or as a function of metal ion or salt concentration. 
Modified versions which can be precipitated from a wide 
variety of solutions by adjusting the pH of the solution 

3 5 to between about 3.2 and about 6.0, more preferably to 

between about 3.5 and about 5.5, are preferred. The 
modified functional version typically also retains the 
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ability of a carbonic anhydrase to strongly bind with 
the inhibitors, benzenesulf cnamide, acetazolamide, or 
derivatives thereof. A detailed discussion of the 
binding properties of carbonic anhydrases with respect 
5 to sulfanilamide and acetazolamide inhibitors is set 
forth in International Application No. PCT/US91/04511 , 
which is hereby incorporated by reference. 

Examples of suitably modified carbonic anhydrases 
include functional substitution mutants which (I) do not 
10 contain methionine, (II) have all or some glutamates 
replaced by another negatively charged amino acid, 
preferably aspartate, (III) have all or some arginines 
replaced by another positively charged amino acid, 
preferably lysine, (IV) have all or some asparagines 
15 replaced by another amino acid, preferably glutamine, 
(V) have- methionine replaced by another amino acid, 
preferably alanine, serine, cysteine, threonine or 
leucine, or (VI) have cysteine replaced by another amino 
acid, preferably serine, threonine, leucine or alanine. 
20 Examples of suitably modified versions of carbonic 

anhydrase also include polypeptides representing a 
functional fragment of a carbonic anhydrase such as 
hCAII, such as but not limited to hCAII C-terminated at 
cysteine 205, asparagine 231, methionine 240, or leucine 
25 250 or N-terminated at proline 21 or glycine 25. The 
functional fragment retains the ability of a carbonic 
anhydrase to be precipitated from a wide range of 
solutions by adjusting the pH. 

Other suitably modified versions include 
30 substitution mutants of hCAII and substitution mutants 
of the functional fragments of hCAII having the 
following substitutions: 

i. hCAII with all or some glutamate amino acid 
residues replaced by another negatively 
35 charged amino acid (AA) , preferably an 

aspartic acid residue, 
ii. hCAII with all or some arginine AA residues 
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preferably a lysine residue, 
iii. hCAII or a functional fragment thereof with 

one or more of the following modifications to 
5 the AA positions at [N11X # G12X] (asparagine 

glycine) , [N62X, G63X] , [N231X, G232X] , M240X 
(methionine) , or C205X (cysteine) modified as 
follows (where those AA positions are present 
in a given fragment) : the asparagine is 
10 changed to glut amine or glycine is changed to 

alanine, methionine is changed to alanine, 
serine, cysteine, threonine or leucine, and 
cysteine is changed to serine threonine, 
leucine or alanine. 
15 The carbonic anhydrase and fusion proteins 

including the carbonic anhydrase preferably are capable 
of being precipitated from a wide range of solutions by 
adjusting the pH of the solution to between about 3.2 
and about 6.0, preferably between about 3.5 and about 
2 0 5.5, and more preferably between about 4.0 and about 
5.0. Examples of solutions from which the carbonic 
anhydrase can be precipitated using this method include 
solutions containing polyethylene imine , urea (e.g., at 
least about 2M) or guanidine hydrochloride (e.g., at 
25 least about 2M) . Preferably, the carbonic anhydrase 
remains soluble in a wide variety of solutions outside 
of the pH range of about 3.2 to about 6.0. 

Variable Fused Peptide 

30 The variable fused polypeptide may have several 

forms (see e.g., Figure 1). One includes a single 
target peptide. A second is composed of multiple tandem 
units of a single target peptide that are linked by an 
amino acid or an intraconnecting peptide. In this case, 

35 the intraconnecting peptide usually but not necessarily 
differs in structure and selectivity from the 
interconnecting peptide (which links the carbonic 
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anhydrase and the variable fused polypeptide) . When 
different, the intraconnecting peptide nevertheless has 
the same general function as the interconnecting peptide 
so that two different cleavage agents of an enzymatic or 
5 chemical nature will separately cleave the variable 

fused polypeptide from the carbonic anhydrase and cleave 
the individual target peptides from each other. The 
third form is a single unit composed of several (i.e., 
two or more) identical or different target peptides 

10 tandemly interlinked together by innerconnecting 
peptides. The fourth form is composed of repeating 
multiple tandem units linked together by intraconnecting 
peptides wherein each unit contains the same series of 
different individual target peptides joined together by 

15 innerconnecting peptides. The fifth form is composed of 
a series, of tandem units linked together by 
intraconnecting peptides wherein each unit contains 
several identical or different target peptides joined by 
innerconnecting peptides and the target peptides do not 

20 repeat from unit to unit. The sixth form is composed of 
identical multiple tandem units wherein each unit 
contains several identical target peptides joined by 
innerconnecting peptides. 

The target peptide may incorporate all or a portion 

25 of any natural or synthetic peptide desired as a 

product, e.g., any desired protein, oligopeptide or 
small molecular weight peptide. For the purposes of 
this application a peptide includes at least two amino 
acid residues linked by a peptide bond. Suitable 

3 0 embodiments of the target peptide which may appear as 
single or multiple linked units in the variable fused 
polypeptide include caltrin, calcitonin, insulin, tissue 
plasminogen activator, growth hormone, growth factors, 
growth hormone releasing factors, erythropoietin, 

3 5 interferons, interleukins , oxytocin, vasopressin, ACTH, 
collagen binding protein, parathyroid hormone, glucagon 
like peptide, glucagon, proinsulin, tumor necrosis 
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factor, substance P, brain naturetic peptide, individual 
heavy and light antibody chains, individual antibody 
chain fragments especially such as the isolated variable 
regions (VH or VL) as characterized by Lerner, Science , 
5 246, 1275 et . seq. (Dec. 1989) and epitopal regions such 
as those characterized by E. Ward et al., Nature, 341 , 
544-546 (1986) wherein the antibodies, chains, fragments 
and regions have natural or immunogenetically developed 
antigenicity toward antigenic substances. Additional 

10 embodiments of the desired polypeptide include 

polypeptides having physiologic properties, such as 
sweetening peptides, mood altering polypeptides, nerve 
growth factors, regulatory proteins, functional 
hormones, enzymes, DNA polymerases, DNA modification 

15 enzymes, structural polypeptides, neuropeptides, 

polypeptides exhibiting effects upon the cardiovascular, 
respiratory, excretory, lymphatic, immune, blood, 
reproductive, cell stimulatory and physiologic 
functional systems, leukemia inhibitor factors, 

20 antibiotic and bacteriostatic peptides (such as 
cecropins, attacins, apidaecins) , insect icidal , 
herbicidal and fungicidal peptides as well as lysozymes. 

Among suitable variable fused polypeptides are 
peptides which include an amino acid sequence having the 

25 formula: 

-TargP-(CS2)- [ - (Lnl) n - (CS1) m -TargP- (CS2 ) - ] r 
wherein the -CS1- and -CS2- are cleavage sites, the 
-(Lnl)- is a linking sequence, the -TargP- is the target 
peptide, n and m are 0 or 1, and r is an integer from 1 

30 to about 150. The target peptide may include an amino 
acid sequence corresponding to GRF(1-41) (SEQ ID NO:23), 
GLPK7-34) (SEQ ID N0:21) or PTH(l-34) (SEQ ID NO:22). 
The - (CS1) - and - (CS2) - cleavage sites may be a chemical 
cleavage site or an enzymatic cleavage site. The 

3 5 enzymatic cleavage site may be recognized by an 

endopeptidase or by an exopeptidase (e.g., where r is 1 
and the - (CS2 ) - is the C-terminal residue of the 



WO 96/17942 PCT/US95/15800 

11 

construct) . Preferably, the enzymatic cleavage site is 
recognized by an endopeptidase . 



Cleavage Site 

5 The fusion protein construct may include a chemical 

cleavage site or an enzymatic cleavage site. The 
cleavage site or sites which may be incorporated into 
the fusion protein construct will depend upon the 
identity of the target peptide (s) present. The cleavage 

10 site and target peptide are typically selected so that 
target peptide does not contain an amino acid sequence 
corresponding to the cleavage site. Secondary 
considerations will also influence the choice of a 
particular cleavage site. In some instances, the 

15 cleavage sites may be designed so as to avoid the use of 
a enzymatic cleavage reaction. This may be accomplished 
by employing a chemical cleavage site, such as a site 
which may by cleaved after treament with an S- 
cyanylating agent (e.g, 2-nitro-5-thiocyanatobenzoate) 

20 or by treatment with an acid having a pK a of no more than 
about 3.0. In other instances, it may be desireable to 
employ a cleavage site which permits a modification of 
the target peptide to introduce a. specific functional 
group, e.g., a C- terminal a-carboxamide group. 

25 Chemical and enzymatic cleavage sites and the 

corresponding agents used to effect cleavage of a 
peptide bond close to one of these sites are described 
in detail in International Application Nos . 
PCT/US91/04511 and PCT/US94/08125 , the disclosure of 

3 0 which is herein incorporated by reference. Examples of 
peptide sequences (and DNA gene sequences coding 
therefor) suitable for use as cleavage sites in the 
present invention and their corresponding cleavage 
enzymes or chemical cleavage conditions are shown in 

3 5 Table 1 below. The gene sequence indicated is one 
possibility coding for the corresponding peptide 
sequence . Other DNA sequences may be constructed to 
code for the same peptide sequence. 
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Table 1 



Enzymes 
5 for Cleavage 

Enterokinase 



10 Factor Xa 



Thrombin 



15 



20 



Ubiquitin Cleaving 
Enzyme 

Renin 



Trypsin 
25 Chymotrypsin 

Clostripain 
3 0 S. aureus V8 



Peptide 
Sequence 

(Asp) 4 Lys 
(SEQ ID NO: 2) 

IleGluGlyArg 
(SEQ ID NO: 4) 

GlyProArg or 
GlyAlaArg 

ArgGlyGly 



HisProPheHisLeu- 
LeuValTyr 
(SEQ ID NO: 6) 

Lys or Arg 

Phe or Tyr or Trp 

Arg 

Glu 



DNA Sequence 

GACGACGACGATAAA 
(SEQ ID NO:l) 

ATTGAAGGAAGA 
(SEQ ID NO: 3) 

GGACCAAGA or 
GGAGCGAGA 

AGAGGAGGA 



CATCCTTTTCATC- 
TGCTGGTTTAT 
(SEQ ID NO: 5) 

AAA OR CGT 

TTT or TAT or 
TGG 

CGT 
GAA 



Chemical 
35 Cleavage 

(at pH3) 



40 



45 



( Hydr oxy 1 amine ) 
(CNBr) 

BNPS-skatole 

2-Nitro-5- 
thiocyanatobenzoate 



Peptide 
Sequence 

AspGly or AspPro 



AsnGly 

Methionine 

Trp 

Cys 



DNA Seaence 

GATGGA or 
GATCCA 

AATCCA 

ATG 

TGG 

TGT 
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Production of Recombinant Peptides 

Host cells transformed with an expression vector 
carrying a recombinant fusion protein construct gene may 
be employed to express the fusion protein construct. 
5 The host cell may be a prokaryotic or eukaryotic host 
cell or a cell in a higher organism. In one preferred 
embodiment of the invention, the host cell is a 
microbial host cell such as E. coli. The vector 
carrying the protein purification construct gene may be 

10 prepared by insertion of the DNA segments coding for the 
fusion protein construct into an appropriate base 
vector. Methods for expression of single- and multicopy 
recombinant fusion protein products are disclosed in 
International Application No. PCT/US91/04511 , the 

15 disclosure of which is incorporated herein by reference. 

The expression vector incorporates the recombinant 
gene and base vector segments such as the appropriate 
regulatory DNA sequences for. transcription, translation, 
phenotyping, temporal or other control of expression, 

2 0 RNA binding and post-expression manipulation of the 

expressed product. Structural features such as a 
promoter, an operator, a regulatory sequence and a 
transcription termination signal are generally included 
in the expression vector. The expression vector may be 

25 synthesized from any base vector that is compatible with 
the host cell or higher organism and provides the 
foregoing features. The regulatory sequences of the 
expression vector will be specifically compatible or 
adapted in some fashion to be compatible with 

30 prokaryotic or eukaryotic host cells or higher 

organisms. Post-expression regulatory sequences, which 
cause secretion of the polypeptide construct can be 
included in the eukaryotic expression vector. It is 
especially preferred that the expression vector exhibit 

3 5 a stimulatory effect upon the host cell or higher 

organism such that the polypeptide construct is 
overproduced relative to the usual biosynthetic 
expression of the host. 
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Typically, the recombinant gene is inserted into an 
appropriate base vector which is used to transform host 
cells or higher organisms with the resulting recombinant 
vector. The fusion protein construct may be expressed 
5 within the host cell or higher organism as a soluble 
product or as a product that is insoluble in the cell. 
Alternatively, the fusion protein construct may be 
expressed as a secreted product by the host cell or 
higher organism. Preferably, the fusion protein 

10 construct may be expressed in a host cell such as E. 

coli as a part of an inclusion body, i.e., an aggregate 
of insoluble material . Fusion protein constructs which 
are expressed as part of an inclusion body typically 
include two or more copies of a target peptide. It has 

15 been found that most single copy fusion protein 

constructs based on a carbonic anhydrase are expressed 
as a soluble product. For example, single copy fusion 
protein constructs which include hCAII and a single copy 
of a target peptide, such as calcitonin, substance P, 

20 angiotension, ATPase-IP, £SU ATPase, and asparagine 

synthetase, are expressed as soluble cell products in E. 
coli. Surprisingly, it has been discovered that in some 
instances, however, a single copy carbonic anhydrase - 
based fusion protein may be expressed as- part of an 

25 inclusion body. In particular, fusion proteins which 
include hCAII and a single copy of GRF(1-41) (SEQ ID 
NO:23), GLPK7-34) (SEQ ID NO:21) or PTH(l-34) (SEQ ID 
NO: 22) are expressed as a part of an inclusion body in 
an E. coli host cell such as E. coli BL21 F" ompT r" B m" B 

30 (DE3). 

The present invention provides a method of 
producing a recombinant peptide which includes 
precipitating either the fusion protein construct or a 
fragment of the fusion protein construct including the 
35 carbonic anhydrase (collectively a "CA-based protein") . 
hCAII is known to precipitate when a solution at about 
neutral pH of the enzyme in its native form is 
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acidified. It is believed that lowering the pH leads to 
a zinc cation being stripped out of the native enzyme 
and substantial conformational changes resulting in its 
precipitation. Surprisingly, it has been found that 
5 carbonic anhydrase and, in particular, a mammalian 
carbonic anhydrase such as hCAII, exhibits this same 
solubility behavior as a function of pH regardless of 
whether the enzyme is in its native form. Moreover, it 
has been discovered that fusion proteins incorporating a 

10 carbonic anhydrase may be precipitated in a similar 
fashion as a function of pH. A CA-based protein 
typically may be precipitated from a solution simply by 
adjusting the pH to between about 3.2 and about 6.0, 
preferably between about 3.5 and about 5.5, and more 

15 preferably between about 4.0 and about 5.0. CA-based 
proteins- may also be precipitated from solutions having 
a high salt concentration, e.g., from solutions which 
include at least about 5% (w/v) sodium sulfate or at 
least about 5% (w/v) ammonium sulfate. 

20 In one embodiment of the invention, a recombinant 

peptide is isolated by a process which includes 
precipitating the fusion protein construct. The 
precipitation step, may be carried out with a fusion 
protein construct which has been expressed as a soluble 

25 product or with a fusion protein construct which has 
been expressed as a part of an inclusion body. In the 
latter instance, the inclusion body is dissolved prior 
to the precipitation step, e.g., in a solution which 
includes at least about 0 . 5M citric acid. The fusion 

30 protein construct may also be dissolved in a variety of 
other solutions, such as a solution containing a 
chaotropic agent (e.g., guanidine hydrochloride) or a 
detergent (e.g., SDS) , a solution containing at least 
about 2 M urea or a solution having a pH of at least 

35 about 10.0. 

The fusion protein construct may be precipitated 
from the above solutions by adjusting the pH of the 
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solution to between about 3.2 and about 6.0 and 
preferably to between about 3.5 and about 5.5, The 
efficiency of the precipitation may be enhanced in some 
cases by adding a salt to the solution in addition to 
5 adjusting the pH. Suitable salts include sodium 

chloride, potassium chloride, manganese sulfate, sodium 
sulfate, sodium acetate, and lanthanum chloride. The 
addition of divalent metal ions, such as Mg 2 \ Ca 2 \ Zn 2 \ 
Sr 2+ or Co 2+ together with the pH adjustment may also 

10 enhance the efficiency of the precipitation. If the 
solubilized fusion protein construct is precipitated 
from an alkaline solution, the pH of the solution is 
typically adjusted by adding an acid such as acetic 
acid, formic acid, citric acid, phosphoric acid, and 

15 hydrochloric acid. As an alternative to precipitation 
based on pH adjustment, the solubilized fusion protein 
construct may be also precipitated from solution simply 
by the addition of a sufficient amount of a salt such as 
sodium sulfate or ammonium sulfate. 

20 In some instances, it may be useful to remove cell 

debris and nucleic acid material from a crude cell 
lysate as an initial step in the isolation of a 
recombinant peptide. This is , typically accomplished by 
adding at least about 0.25% and preferably at least 

25 about 0.38% polyethyleneimine (PEI) to the lysate. The 
addition of the PEI generally causes cell debris and 
about 90-95% of the DNA present to be precipitated. The 
mCA-based fusion protein may then be precipitated from 
the supernatant simply by adjusting the pH of the PEI 

30 solution to between about 3.5 and about 6.0. 

The precipitated fusion protein may be 
resolubilized, e.g., by dissolution in a solution having 
a pH of at least about 10, and preferably at least about 
10.5. The precipitated fusion protein may also be 
3 5 redissolved in other types of solutions, such as 

solutions which include a chaotropic agent, a detergent 
or at least about 0.5 M citric acid. After cleavage, 



WO 96/17542 



PCT/US95/158(K) 



17 

the fragment containing the carbonic anhydrase 
("carbonic anhydrase fragment") may be removed by 
chromatography, filtration, or preferably, by 
precipitation. As noted above, the carbonic anhydrase 
5 fragment may be precipitated simply by again adjusting 
the pH of the solution to between about 3.5 and about 
6.0. If desired, as an alternative, the carbonic 
anhydrase fragment may be separated from the cleavage 
products using a ligand immobilized affinity separation 

10 technique. For example, the carbonic anhydrase 

fragment, in native or renatured form, may be removed by 
contacting a solution of the cleavage products with an 
inhibitor (e.g., an affinity column) which includes 
benzenesulf onamide compound or an acetazolamide 

15 compound. Preferably the benzenesulf onamide compound 
includes a sulfanilamide compound such as an amino 
substituted benzenesulf onamide (e.g., 4-arnino- 
benzenesulf onamide) or a derivative thereof. The 
acetazolamide compound is 5-acetamido-l, 3 , 4-thiadiazol- 

20 2-sulf onamide or a derivative thereof. 

In another embodiment of the invention, the method 
of producing a recombinant peptide includes (i) cleaving 
the fusion protein construct to produce a soluble 
carbonic anhydrase fragment and a soluble variable fused 

2 5 polypeptide fragment, and (ii) precipitating the 

carbonic anhydrase fragment. The carbonic anhydrase 
fragment may include other amino acids residues in 
addition to the carbonic anhydrase. These other amino 
acid residues typically are derived from an 

3 0 interconnecting peptide present in the fusion protein 

construct. Suitable interconnecting peptides for use in 
the present invention include amino acid sequences 
containing a chemical or enzymatic cleavage site. After 
cleavage, the carbonic anhydrase fragment typically 
35 includes at least a portion of the interconnecting 
peptide or a derivative thereof. The derivative 
generally is produced as a byproduct of the cleavage 
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reaction. For example, a carbonic anhydrase fragment 
having a C-terminal homoserine residue may be created by 
cleavage of a fusion protein construct at the C-terminal 
peptide bond of a methionine residue . 
5 The supernatant fraction remaining after the 

precipitation of the carbonic anhydrase fragment may be 
isolated by centrifugation. The supernatant fraction 
contains the variable fused polypeptide fragment and, if 
desired, the variable fused polypeptide fragment 

10 purified further by conventional methods used for the 
isolation of peptides. 

To enhance the recovery of the variable fused 
polypeptide fragment, the precipitated carbonic 
anhydrase fragment may be extracted with a solvent which 

15 includes an organic solvent. Suitable solvents include 
an 'organic component such as acetonitrile, propenol, 
citric acid, polyethyleneglycol, or mixtures thereof. 
For example, aqueous solutions including an organic 
solvent, e.g., 50% aqueous acetonitrile and 50% aqueous 

2 0 n-propanol, may be used to carry out the extraction. 

Prior to being subjected to the cleavage reaction, 
the fusion protein construct may be purified by a 
variety of methods. For example, the fusion protein 
construct may be purified using a ligand immobilized 

25 affinity separation technique or by precipitation. 
Purification of the fusion protein construct by 
precipitation is preferred as it allows the recombinant 
fusion protein construct to be produced by a process 
which relies solely on wet chemical separation processes 

30 such as precipitation, filtration or centrif ugation 
without the need for renaturation or affinity 
purification. 

A third embodiment of the invention provides a 
method of producing a recombinant peptide which includes 

35 expressing the fusion protein construct as a part of an 
inclusion body in an E. coli host cell, and isolating 
the fusion protein construct. The fusion protein 
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construct typically includes multiple copies of a target 
peptide. As noted above however, it is possible to 
express fusion protein constructs which include an mCA 
and a single copy of an amino acid sequence 

5 corresponding to GRFd-41) (SEQ ID NO:23), GLPl(7-34) 
(SEQ ID NO: 21) and PTH(l-34) (SEQ ID NO:22). Examples 
of suitable single copy fusion proteins which may be 
expressed as inclusion bodies in an E. coli host cell 
are inclusion fusion proteins which include hCAII and a 

10 single copy of any one of GRF(1-41) (SEQ ID NO:23) , 
GRF(l-44) (SEQ ID NO:20), GLPK1-34) (SEQ ID NO:26), 
GLPK7-34) (SEQ ID NO:21), GLPK7-36) (SEQ ID NO:27), 
GLPK7-37) (SEQ ID NO:28), PTH(l-34) (SEQ ID NO:22), 
PTH(l-38> (SEQ ID NO:29) and PTH(l-84) (SEQ ID NO:25). 

15 The inclusion bodies may be isolated from a crude 

cell lysate by conventional techniques, e.g., by 
centrifugation. The crude inclusion bodies may be 
subjected to an initial purification step such as 
washing the inclusion bodies with a 50mM Tris, ImM EDTA, 

20 pH 7.8 solution or a lOOmM EDTA solution to remove any 
cell debris and/or nucleic acid materials present. 

The inclusion bodies may be dissolved under a 
variety of conditions. For example, the inclusion 
bodies may be dissolved under acidic conditions in a 

25 .solution having a pH of no more than about 3.2 (e.g., in 
a solution which includes at least about 500mM citric 
acid) . Alternatively, the inclusion bodies may be 
dissolved in a solution which has a pH of at least about 
10. The dissolution of the inclusion bodies under basic 

30 conditions may be facilitated by the addition of a small 
amount of a surfactant, such as N-lauroyl sarcosine, to 
the solution. The inclusion bodies may also be 
dissolved in a solution which includes an effective 
amount of a chaotropic agent such as 6N HCl or urea, or 

35 a detergent such as N-lauroyl sarcosine, sodium dodecyl 
sulfate, sodium octyl sulfate or cetyl trimethyl 
ammonium bromide (CTAB) . It has unexpectedly been 
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discovered that after the inclusion bodies have been 
" dissolved in the solutions described above, the 
solubilized fusion protein construct may be precipitated 
by adjusting the pH of the solution to between about 3.5 
5 and about 6.0, and preferably to between about 4.0 and 
about 5.0. As an alternative, the solubilized fusion 
protein construct may be isolated by a process which 
includes a ligand immobilized affinity separation 
technique . 

10 Another embodiment of the invention includes 

cleaving the fusion protein construct to produce a 
carbonic anhydrase fragment and a variable fused 
polypeptide fragment and precipitating both the carbonic 
anhydrase fragment and the variable fused polypeptide 

15 fragment. The precipitated fragments may be extacted 
with a solvent which includes an organic component to 
recover the variable fused polypeptide fragment . 
Suitable solvents include an organic component such as 
acetonitrile, propanol, citric acid, polyethyleneglycol , 

20 or mixtures thereof. 

The invention will be further described by 
reference to the following detailed examples. 

Example 1. Description of the Expression System 

25 Origin, phenotvpe and genotype of the host cells. 

The bacterial host for expression, E. coli BL21 F 
ompT r' B m' B (DE3) was obtained from Novagen, Inc., Madison, 
WI. These E. coli cells gave high levels of expression 
of genes cloned into expression vectors containing the 

3 0 bacteriophage T7 promoter. Bacteriophage (DE3) which 
contains the T7 RNA polymerase gene has been integrated 
into the chromosomal DNA of the BL21 (DE3) cells. The 
T7 RNA polymerase gene is controlled by the lacUV5 
promoter and the lad gene product. 

35 

Construction of pBNl 

An expression vector, pET31FlmhCAII containing the 
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hCAII gene was obtained from Dr. P.J. Laipis at the 
University of Florida. The pET31FlmhCAII was prepared 
as described by Tanhauser et al., Gene , 117 , 113 (1992). 
Plasmid pET31FlmhCAII contains the coding region for 
5 hCAII (human carbonic anhydrase II) downstream of a 
bacteriophage T7 promoter in a pUC-derived plasmid 
backbone. Two synthetic oligonucleotides, 5 ' -A GCT TTC 
GTT GAC GAC GAC GAT ATC TT-3' (SEQ ID N0:7) and its 
complementary sequence 5'-AGC TAA GAT ATC GTC GTC GTC 

10 AAC GAA- 3 ' (SEQ ID NO : 8 ) , were cloned into pET31FlmhCA2 
which had been digested with Hind III. This plasmid was 
designated pAl. 

Plasmid pAl was digested with the restriction 
endonucleases Ssp I and BspE I and the resulting ends 

15 were made blunt by treatment with T4 DNA polymerase. 

The DNA * fragment from the pAl digest containing the T7- 
hCAII-cassette was subcloned into the Sea I restriction 
site of pBR322 (New England Biolabs) thus conferring 
tetracycline resistance, but not ampicillin resistance. 

20 The resulting plasmid was designated pBNl . 

Construction of pBN4 

The pAl plasmid was opened at the Hind III site and 
the EcoR V site and the synthetic oligonucleotide, 5 '-A 

2 5 GCT GAA TTC AAC GTT CTC GAG GAT -3' (SEQ ID NO: 9) and 
its complementary sequence 5 ' -ATC CTC GAG AAC GTT GAA 
TTC-3' (SEQ ID NO: 10), were cloned into the vector. The 
insertion of these oligonucleotides provides a T7-hCAII- 
cassette containing unique EcoR I and Xho I restriction 

30 sites at the carboxyl terminal of hCAII. The resulting 
plasmid was designated pA3 . 

The pBNl vector was digested with EcoR I and the 
single stranded overhangs were filled in with Polymerase 
I Large (Klencw) Fragment. The linear plasmid with 

35 newly formed blunt ends was religated, thus destroying 
the EcoR I site. The resulting plasmid was designated 
pBN3 . 
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Plasmid pA3 was digested with the restriction 
endcnucleases, Xba I and BspE I. The DNA fragment from 
the pA3 digest containing the T7-hCAI I -cassette was 
subcloned into the pBNl vector which had been digested 
5 with Xba I and BspE I. The resulting vector was 
designated plasmid pBN4 . 

Example 2. Inoculum Preparation for hCA-Fusion 
Constructs 

10 L-broth was sterilized in the autoclave at 121°C 

for 20 minutes on the liquid cycle setting. The glucose 
and tetracycline stocks were filter sterilized by 
passage of the solution through a 0.22 fim filter. Two 
250 ml shake flasks were each charged with the following 
15 solutions: 

50 ml L-broth (1.0 % tryptone, 1.0% NaCl, 0.5% 
yeast extract) 

1.0 ml glucose stock (50 mg/ml) 
150 ill tetracycline stock (5.0 mg/ml) 
20 100 jil thawed inoculum of E. coli cells transfected 

with a vector coding for the desired hCA- fusion 
construct 

The shake flasks were placed in an incubator shaker at 
37°C / 200-220 rpm for 10-14 hours. The optical density 
25 (O.D.) of the cells in the resulting solutions was then 
measured at 540 nm. A 1:25 dilution was usually 
necessary to obtain a proper reading. One of the two 
shake flasks was then chosen for inoculating the next 
set of shake flasks. Three 500 ml shake flasks were 
30 each charged with the following sterilized solutions: 
200 ml L-broth (1.0 % tryptone, 1.0% NaCl, 0.5% 
yeast extract) ; 

4.0 ml glucose stock (50 mg/ml); 
600 pi tetracycline stock (5.0 mg/ml); 
35 1.0 ml inoculum from one of the first two shake 

flasks . 
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The three shake flasks were placed in an incubator 
shaker under the conditions described above and the 
cells were allowed to grow for 8-10 hours. The optical 
density of the resulting solutions was then measured at 
5 540 nm (typically at a 1:25 dilution) . All three shake 
flasks were then used to inoculate the fermentor. 

gyftTn plg 3. Fermentation of hCA-Fusion Construct on 

60 L Scale 

10 Fermentation media was added to the fermentor and 

the volume was adjusted to 45.0 L with distilled H 2 0. 
The media contained the following: 12 00.0 g Case amino 
acids; 3 00.0 g Yeast extract; 30.0 g NaCl; and 0.10 ml 
Antifoam. The fermentor was ■ sterilized at 121°C for 25 

15 minutes. The fermentor was cooled to 37°C. Before 

inoculation, the following solutions were added to the 
fermentor: 

glucose (480.0 g in 800.0 ml H 2 0) 
magnesium (120.0 g MgS0 4 -HO in 250.0 ml H 2 0) 
20 phosphates (120.0 g K 2 HP0 4 & 465. 0 .g KH 2 P04 in 

3.0 L H 2 0) 

tetracycline (0.90 g tetracycline -HC1 in 30.0 
ml 95% EtOH & 20.0 ml H 2 0) 

mineral mix (Dissolved in 490.0 ml H 2 0 & 10.0 
25 ml concentrated HCl) : 

3 . 6 g FeS0 4 ■ 7H 2 0 
3.6 g CaCl 2 -2H 2 0 
0.90 g MnS0 4 
0.90 g AlCl 3 *6H 2 0 
30 0.09 g CuCl 2 -2H 2 0 

0.18 g Molybdic Acid 
0.36 g CoCl 2 '6H 2 0 
All of the above solutions were sterilized for 20 
minutes in the liquid cycle in an autoclave except for 
35 the tetracycline and mineral mix solutions. These were 
sterilized by passage through a .22 p filter. At this 
point, the pH typically had dropped to approximately 
6.5. If this had occurred, base (14.8 N Ammonium 
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hydroxide) was added to adjust the pH to 6.8. After the 
pH reached 6.8, 600.0' ml inoculum was added to the 
f ermentor . The following parameters were monitored at 
time zero and throughout the fermentation: Glucose 
5 concentration (maintained at about 2-5 g/L) ; Optical 

Density; pH (6.8 is near optimal); Dissolved Oxygen (40% 
is near optimal) ; and Agitation. The temperature was 
maintained at 37° C throughout the fermentation. Air 
intake was 40 L/min at the beginning of the 

10 fermentation. The initial dissolved oxygen 

concentration was 90% but quickly dropped to 40%. It 
was maintained at this level via increased agitation and 
oxygen supplementation throughout the fermentation. 
When oxygen supplementation was started, the air inlet 

15 was reduced to 20 L/min. The initial glucose 

concentration was approximately 9 g/L but dropped to 5 
g/L after about six hours. Once the glucose 
concentration dropped to this level, a glucose feed (70% 
w/v glucose) was used to maintain the glucose 

20 concentration at 5 g/L. 

When the fermentation had proceeded to the point 
where an O.D. of 15-20 was measured, the media feed was 
started. The media feed consisted of 1200.0 g case 
amino acids and 300.0 g yeast extract dissolved in 5 . 0 L 

25 distilled H 2 0 and sterilized for 20 minutes on liquid 
cycle. The media feed was added to the f ermentor over 
1.0-1.5 hours. When fermentation had produced an O.D. 
of 30.0, the fermentation was induced by adding the 
following solutions to the f ermentor: 

30 isopropylthiogalactoside (IPTG; 28.8 g in 200 ml 

distilled H 2 0) ; ZnCl 2 (O.B18g in 50 ml distilled H 2 0 with 
one drop of 6N HCl) . The IPTG solution was filter 
sterilized through 0.22 /im filter. The ZnCl 2 solution 
was sterilized for 20 minutes using the liquid cycle in 

35 the autoclave. After the addition, the concentration of 
IPTG in the f ermentor was 2.0 mM and the concentration 
of ZnCl 2 was 100 jiM. A feed of a mixture of amino acids 
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was- then started at this point. The amino acid feed 
consisted of 225.0 g L- serine; 75.0 g L-tyrosine; 74.0 g 
L- tryptophan; 75.0 g L-phenylalanine ; 75.0 g L-proline; 
and 75.0 g L-histidine; and was dissolved in a mixture 
5 of 1.5 L H 2 0 and 500 ml concentrated HC1 . The amino acid 
feed was sterile filtered through a 0.22 /im filter prior 
to addition to the fermentation. Induction was allowed 
to continue for 2,0 hours at which point the 
fermentation broth was transferred to a harvest tank and 
10 chilled to approximately 5-10°C. The fermentation 
typically yielded between 6.5 to 9.5 kg of wet cell 
paste (dry cell weight of about 1.0-1.5 kg). 

Exar " pl-a 4. Harvest of 60 L Ferme ntation 

15 The cell suspension from the fermentor as desribed 

above was concentrated over a tangential crossflow 
membrane to a volume of 10 L. The concentrated cell 
suspension was diafiltered and washed with 30 L of a 
cold wash buffer containing 50 mM Tris-S04 pH 7.8, 1.0 

20 mM EDTA, and 0.10 mM phenylmethylsulf onyl fluoride 

(PMSF) . The cell suspension was then concentrated to 8 
L. The concentration and washing of the cell suspension 
typically required 6-10 hours. At this point, the 
concentrated cell suspension (cell paste) may be bagged 

25 and frozen for later processing or transferred to 
homogenizer holding tanks for cell lysis. 

Exam ple 5. Cell Lysis 

The cell paste obtained from the 60 L fermentor was 
30 diluted to 32 L in cold wash buffer (see above) and the 
resulting cell suspension was chilled to 5-10°C. The 
chilled cell suspension was homogenized at 12,000 psi 
with a Galin high pressure homogenizer. The homogenized 
cell paste was passed through a heat exchanger to chill 
35 the lysate to 10°C and passed through the homogenizer a 
second time. 
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Example 6. Preparation of Thrombin Stock Solution 

The thrombin used in fusion protein cleavage 
.reactions was obtained from Calbiochem in the form of a 
lyophilized powder. This powder was solubilized prior 
5 to use by dissolution in MilliQ water to a concentration 
of 1 mg/ml as determined by specific activity. 

Example 7. Production of PTH(l-34) (SEQ ID NOs22) 

The preparation of the DNA segment coding for a 

10 single copy PTH(l-34) (SEQ ID NO: 22) fusion protein was 
carried out by preparing an expression vector coding for 
a PTH- construct which included a DNA segment coding for 
the hCAII, an interlinking peptide, and PTKU-34) (SEQ 
ID NO: 22) . The following oligos were obtained from 

15 Operon Technologies Inc, 1000 Atlantic Ave. Alameda CA 
945'01. 

Oligo 1: 5' CCC AAG CTT CTG TTC GTG GTC CGC GTT CTG 

TTT CTG AAA (SEQ ID NO: 11) 
Oligo 2: 5' GAA ACA GAA CGC GGA CCA CGA ACA GAA GCT 
20 TGG G (SEQ ID NO: 12) 

Oligo 3: 5' TCC AGC TGA TGC ACA ACC TGG GTA AAC.ACC 

TGA ACT (SEQ ID NO: 13) 
Oligo 4: 5' AGG TGT TTA CCC AGG TTG TGC ATC AGC TGG 
ATT TCA (SEQ ID NO: 14) 
2 5 Oligo 5: 5' CTA TGG AAC GTG TTG AAT GGC TGC GTA AAA 

AAC TGC A (SEQ ID NO:15) 
Oligo 6: 5' TTT TTT ACG CAG CCA TTC AAC ACG TTC CAT 

AGA GTT C (SEQ ID NO: 16) 
Oligo 7: 5' GGA CGT TCA CAA CTT CTA AGA TAT CCG G 
30 (SEQ ID N0:17) 

Oligo 8: 5' CCG GAT ATC TTA GAA GTT GTG AAC GTC CTG 
CAG (SEQ ID NO:18) 
The eight synthetic DNA oligos were obtained and 
the complementary strands phosphorylated and annealed. 
35 The four double stranded fragments were used to prepare' 
a DNA fragment coding for the interpeptide linker 
followed by PTH(l-34) (SEQ ID NO:22). This DNA fragment 
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was digested and inserted into pAl using the restriction 
sites Hind III and EcoR V creating the vector pAl:PTH(l- 
34) . 

The expression cassette was removed from pAl:PTH(l- 
5 34) by digestion with the restriction endonucleases Xba 
I and BspE I and inserted into the vector pBNl which had 
been digested with the same restriction endonucleces . 
This final vector was desiganted pBNl :PTH (1-34) . 
E. coli cells transformed with the vector 

10 pBNl:PTH(l-34) containing DNA coding for an hCA-PTH 
fusion protein were prepared, cultured and lysed 
according to the procedures described above. The hCA- 
PTH fusion protein included hCAII linked to an amino 
acid sequence corresponding to PTH(l-34) (SEQ ID NO: 22) 

15 through a thrombin cleavage site (Gly-Pro-Arg) 

immediately adjacent the N- terminus of the PTH(l-34) 
(SEQ ID NO: 22) sequence. The hCA-PTH fusion protein was 
expressed as inclusion bodies in E. coli BL21 F" ompT r' B rrf B 
(DE3) cells. 

20 

Inclusion Body Purification 

The cell lysate from the 60 L ferment ion run was 
centrifuged in a continuous flow rotor at 20,000 g at 
400 ml/min. The pellet contained 300-500 g of crude 

25 inclusion bodies. The crude hCA-PTH inclusion bodies 
was suspended in 2M citic acid at a concentration of 60 
mg solids/ml (4-8 liters) and sonicated at 70% power 
with a 50% pulse rate. The solubilized inclusion bodies 
were clarified by centrif ugation at 20,000 g. 

30 The hCA-PTH fusion protein in the supernatant was 

precipitated by the addition of 10 N NaOH until the pH 
reaches 4-5. The precipitated hCA-PTH fusion protein 
was collected by centrif ugation at 20,000 g. The 
precipitated hCA-PTH fusion protein was then washed with 

35 4 to 8 liters of 5% acetic acid/45% EtOH in Milli-Q 
water and the precipitated hCA-PTH fusion protein was 
again collected by centrif ugation. The hCA-PTH fusion 



WO 96/17942 PCT/US95/15800 

28 

protein was then washed twice with 4 to 8 liters of 100 
mM EDTA and once with 4-8 liters of Milli Q water and 
the protein collected by centrif ugation after each wash. 

5 Thrombin Cleavage of PTE (1-34) 

A solution containing 200 ml of 50mM NaOH and 0.25% 
N-lauroyl sarcosine is added to a bottle containing a 
pellet of the hCA-PTH fusion protein. The bottles are 
placed into 37°C water bath to warm pellets (typically 

10 for 2-5 minutes) to aid in the solubilization of the 
pellet. The material is homogenized until all large 
pieces are disaggregated. The pH is readjusted 11.6 to 
11.9 with a solution containing' 50mM NaOH and 0.25% N- 
lauroyl sarcosine. The resulting solution is sonicated 

15 until all of the hCA-PTH fusion protein pellet has 
dissolved. Typically sonication is for 2 minutes at 
power 10, pulser on, 70% duty cycle (smaller batches 
require less time) . The protein concentration of the 
reaction solution is determined by absorbance at 280 nm. 

20 Ideally the concentration should be 6-7 mg/ml (8-10 

liters for a 60 1 fermentation) . If the concentration 
is greater than 9 mg/ml, the solution is diluted to 6-7 
mg/ml with the 50mM NaOH/0.25% N-lauroyl sarcosine 
solution. 

25 The protein solution is stirred vigorously and the 

pH is adjust to 8-8.2 with 1M Tris-HCl. If solution is 
hazy, it is filtered through glass fiber filters. The 
solution is then filtered through 0.45 fim cellulose 
acetate membrane and the protein concentration is 

3 0 determined by measuring the absorbance at 280 nm. Solid 
NaCl is added to a final concentration of 250mM. A 
thrombin stock solution 1 mg/ml is added to produce a 
protein to enzyme ratio of 5000:1 w/w. The solution is 
then stirred in a water bath plate at 37°C. The 

3 5 reaction is monitored by HPLC on a C8 column eluted with 
a gradient from Buffer A (95% water, 5% acetonitrile 
(ACN) , 0.1% trifluoroacetic acid (TFA) ) to Buffer B (95% 
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ACN, 5% water, .1% TFA) . The reaction is stopped by the 
addition of PMSF. Typically the reaction is complete as 
determined by the loss of the peak corresponding to the 
starting material in 46-48 hours. The PMSF is dissolved 
5 into 95% EtOH, and added directly to the reaction 

mixture with vigorous stirring. The thrombin cleavage 
procedure yields 5.1 g of PTH(l-34) (SEQ ID NO: 22) from 
60 g of hCA-PTH fusion protein. 

10 Citric Acid Precipitation of Thrombin Cut PTE Material 

Both the resulting human carbonic anhydrase (from 
the cleavage of the hCA-PTH fusion protein) and 
remaining unhydrolized fusion protein are precipitated 
from the solution by adding sufficient citric acid to 

15 achieve 150mM, leaving the desired peptide (PTK 1-34) 
(SEQ ID -NO: 22) in solution. The precipitated material 
is removed by centrif ugation . The supernatant is 
filtered through 0.45 jim filter and frozen at -80°C or 
desalted directly on a C8 column. The yield from the 

20 peptide from the step is 85%. 

If desired the PTH(l-34) (SEQ ID NO:22) may be 
further purified by chromatography on preparative C8 
column using a gradient from 10-70% Buffer B of aqueous 
acetonitrile/ trif luoroacetic acid (ACN/TFA) buffers, 

25 where Buffer A: 0 . 1% T£& x . .95% water, 5% acetonitrile, 
Buffer B.:0.1% TFA, 5% water, 95% acetonitrile. 
Alternatively, the PTH(l-34) (SEQ ID NO:22) sample may 
be purified on the preparative C8 column eluting first 
with a gradient from 10-70% Buffer B; where Buffer A is 

30 5mM HC1, 95% water, 5% acetonitrile and Buffer B. The 
PTH(l-34) (SEQ ID NO: 22) is desalted with a gradient 
where Buffer A lOmM acetic acid, and Buffer B i^ 10 mM 
acetic acid in 50% aqueous ethanol . 

35 Example 8. Productinn of GLP1 (7-36) -NH, (SEQ ID NO: 27) 

Six synthetic DNA oligos (1-6) were phosphorylated 
and the complementary strands annealed. Oligopair 1&2 



WO 96/17942 



PCT/US95/15800 



30 

were inserted into pUC19 (commercially available from 

New England Biolabs. MA) between Hind III and Kpn I. 

Oligopair 3&4 was then inserted into this vector between 

Kpn I and EcoR I yielding a vector named pUC19:GLP(7- 

® 

5 22) . Oligopair 5&6 was ligated into pBluescript IISK+ 
(commercially available from Stratagene) between Kind 
III and EcoR I yielding the vector PB5 :GLP (21-34) AFA. 
The fragment containing oligopair 5&6 was inserted into 
pUC19:GLP(7-22) by digesting pUC19 :GLP (7-22) with Nar 1 

10 and EcoR I and digesting PB5 :GLP (21-34) AFA with Cla I 
and EcoR I and ligating the fragment from PB5:GLP(21- 
34) AFA containing oligos 5&6 into the truncated 
pUC19 :GLP(7-22) to yield pUC19 :GLP (7-34) AFA. 
pUC19 :GLP (7-34) AFA was digested with Hind III and the 

15 fragment containing the sequence coding for the peptide 
interlinker followed by GLP1 (7-34) -Ala-Phe-Ala (SEQ ID 
NO: 30) ("GLP1 (7-34) AFA M ) was transferred to pA4 to yield 
vector pA4 :GLP (7-34) AFA. The hCAII gene in pA4 had been 
mutated to change the methionine-240 to a cysteine. The 

20 expression cassette was then digested from pA4:GLP(7- 

34) AFA with the restriction endonucleases Xba I and BspE 
I and inserted into the vector pBNl digested with the 
same restriction endonucleases. The resulting vector 
was designated pBN6 :GLP (7-34 ) AFA. 

25 E. coli cells transformed with the vector 

pBN6 :GLP1 (7-34) AFA containing DNA coding for an hCA- 
GLP1 (7-34) -Ala-Phe-Ala ( " hCA-GLPl-AFA" ) fusion protein 
were prepared, cultured and lysed according to the 
procedures described above. The hCA-GLPl -AFA fusion 

30 protein included hCAII linked to an amino acid sequence 
corresponding to GLP1 (7-34) -Ala-Phe-Ala (SEQ ID NO:30) 
through a methionine residue located immediately 
adjacent the N-terminus of the GLP1 (7-34) -Ala-Phe-Ala 
(SEQ ID NO: 30) sequence. The hCA-GLPl-AFA fusion 

3 5 protein was expressed as inclusion bodies in E. coli 
BL21 F" ompT r* B m" B (DE3) cells . 
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Inclusion Body Purification 

The cell lysate was centrifuged in a continuous 
flow rotor at 20,000 g at 400 ml/min to produce a pellet 
containing 300-500 g of crude inclusion bodies. The 
5 crude hCA-GLPl-AFA inclusion bodies were suspended in 
lysis buffer at a concentration of 25% solids (1.2-2.0 
L) and homogenized in the Galin homogenizer at 13,000 
psi. The homogenized inclusion bodies were collected by 
centrifugation at 20,000 g, resuspended in a solution of 

10 100 mM Na Acetate pH 4.5 and homogenized a second time. 
The homogenized inclusion bodies were then suspended in 
distilled water. The resulting suspension is 
homogenized and the inclusion bodies collected by 
centrifugation. The purified inclusion bodies are 

15 dissolved in 1.5 M citric acid and clarified by 

centrifugation. The hCA - GLP 1 - AFA fusion protein was 
precipitated by the addition of NaOH until the pH is 4-5 
and isolated by centrifugation. 

20 Cyanogen Bromide Cleavage of hCA-GLPl-AFA 

The hCA - GLP 1 - AFA inclusion bodies were dissolved in 
a solution of 2 . 0 M Citric Acid at a concentration of 
about 35-45 mg/ml . The solution was adjusted pH to 1.0 
with cone. HCl and 0.2 g cyanogen bromide (CNBr) per g 

25 of total protein was added. The protein concentration 
was determined by HPLC. The protein solution was 
sparged with pure Argon before addition of the CNBr. 
The cleavage reaction was allowed to proceed under an 
Argon atmosphere for 4-5 hours at which time residual 

30 CNBr was neutralized by adding 2.2g D, L-methionine per g 
of CNBr added followed by stirring for 30 minutes. 

Purification of GLP1 (7-34) -AFA (SEQ ID NO: 30) 

The solution from the CNBr cleavage reaction was 
35 diluted 1:1 with a solution of 10% sodium sulfate (w/v) 
in water to precipitate all protein and peptide from the 
solution. The sample was centrifuged at 20,000 x g for 
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15 minutes and the supernatant decanted. Distilled 
water (200 ml) was added to each centrifuge vial. The 
resulting mixture was homogenized and centrifuged for 15 
min. at 20,000 x g. The supernatant was decanted and a 
5 volume of 50% acetonitrile (ACN) was added to the pellet 
material. The sample was homogenized, stirred for 10-15 
minutes, and finally centrifuged at 20,000 x rpm for 15 
minutes. The extraction with the 50% acetonitrile 
solution was repeated two to three times to achieve a 
10 90-95% recovery of GLP1 (7-34) AFA (SEQ ID NO: 30) in the 
acetontrile solution. The extraction may also be 
carried out with 50% n-propanol . 

Batch loading of 50% ACN extraction onto DOWEX-1 

15 DOWEX-1 (25 g for each 250 mg of GLP1 (7-34) -AFA 

(SEQ ID NO: 30) is prewashed with 10% ACN solution prior 
to being added to the 50% ACN extraction solution. The 
DOWEX-1 is loaded in a batch fashion and the suspension 
stirred for 20 minutes. The DOWEX-1 is then filtered 

20 away from the GLP1 (7-34 ) -AFA (SEQ ID NO:30) solution. 

Low Pressure C8 chromatography 

The 50% ACN solution is diluted to 12.5% ACN with 
distilled H 2 0 to facilitate the binding of GLP1 (7-34) -AFA 

25 (SEQ ID NO: 30) to the C8 resin. The peptide solution is 
loaded. onto the C8 resin in an low pressure column and 
the column is washed with 5 column volumes of 3 0% (v/v) 
ACN solution. The GLP1 (7-34) -AFA (SEQ ID NO:30) is then 
eluted with a solution of 50% ACN in distilled H 2 0. The 

30 50% ACN solution is lyophilized to obtain the GLPK7- 
34) -AFA (SEQ ID NO:30) . 

Transpeptidation of GLP1 (7-34) AFA (SEQ ID NO:30) 
into GLP1(7-36)NH 2 (SEQ ID NO:27) 

35 GLP1 (7-34) AFA (SEQ ID NO: 30) (1 mmole) is dissolved 

in a 50/50 (v/v) solution of DMF/H 2 0 which includes 5 mM 

CaCl 2 and 400 mM Gly-Arg-NH 2 and the solution pH is 

adjusted to 8 . 0 with 5 M NaOH. Two mg of TPCK-treated 
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trypsin is then added. The transpeptidation reaction 
which produce GLP1 (7-36 )NK 2 (SEQ ID NO: 27) is complete in 
2-3 hours at room temperature. 

If desired the GLP1 (7-36 ) -NH 2 (SEQ ID NO: 27) may be 
5 further purified by chromatography on preparative C8 
column using a gradient of aqueous 

acetonitrile/trifluoroacetic acid (ACN/TFA) buffers. 

Example 9. Production of GRF(1-44)NH 7 (SEP ID NO:20) 

10 Eight oligonucleotides containing segments of the 

linker and the peptide were phosphorylated and 
complementary oligonucleotide pairs 1&2, 3£4, 5&6, and 
7&8 were annealed. Oligonucleotide pairs 1&2 and 3&4 
were simultaneously ligated into the pTZ19R vector 

15 (commercially available from Pharmacia Biotech Inc., NJ) 
between - the Hind III and Sal I sites to yield pTZ:GRF(l- 
29) . Oligonucleotide pairs 5&6 and 7&8 were 
simultaneously ligated into a separate pTZ19R vector 
between the Sal I and EcoR I sites to yield pTZ:GRF(29- 

2 0 44) A. The gene fragments were cloned adjacent to each 
other in a single vector by digesting pTZ :GRF (1-29) and 
pTZ:GRF (29-44) A with the restriction endonucleases Xmn I 
and Sal I, isolating the 1.9 kb band from the pTZ:GRF(l- 
29) vector and the 0.9 kb band of the pTZ :GRF (29-44) A 

25 vector and ligating them together to yield the vector 
pTZ : GRF ( 1 - 4 4 ) A . 

The three Asn-Gly sites in hCAII (located at 
positions 10-11, 61-62 and 230-231) were changed to 
GlnlO-Glyll, Gln61-Gly62 and Asn230-Ala231 by site 

30 mutagenesis of specific codons in plasmid pAl . All 
three mutations were combined to create plasmid pA2 . 

The pA2 vector was digested with EcoR V. 
pTZ : GRF (1-44) A was digested with Dra I and EcoR V. The 
fragment containing the GRF gene and the linearized pA2 

35 plasmid were ligated together to yield pBN2 :GRF (1-44) A. 

E. coli cells transformed with vector pBN2:GRF(l- 
44) A containing DNA coding for an hCA-GRF fusion protein 
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were prepared, cultured and lysed according to the 
procedures described above. The hCA-GRF fusion protein 
included hCAII linked to an amino acid sequence 
corresponding to GRF (1-44) -Ala (SEQ ID N0:31) through an 
5 interconnecting peptide. The interconnecting peptide 
included a thrombin cleavage site and an enterokinase 
cleavage site. The enterokinase cleavage site (Asp-Asp- 
Asp-Asp-Lys) (SEQ ID NO: 2) was positioned immediately 
adjacent the N-terminus of the GRF (1-44) -Ala (SEQ ID 

10 NO: 31) sequence. The thrombin cleavage site (Gly-Pro- 
Arg) was located so that after treatment with thrombin, 
the hCA-GRF fusion protein produced a peptide fragment 
having an 8 amino acid sequence (Ala-Met-Val-Asp-Asp- 
Asp-Asp-Lys) (SEQ ID NO: 19) connected to the N-terminus 

15 of the GRF (1-44) -Ala (SEQ ID NO: 31) sequence. The hCA- 
GRF fusion protein was expressed as inclusion bodies in 
E. coli BL21 F" ompT r" B m" B (DE3) cells. 

Inclusion Body Purification 

20 The cell lysate was centrifuged in a continuous 

flow rotor at 20,000 g at 400 ml/min to produce a pellet 
containing 300-500 g of crude inclusion bodies. The 
crude hCA-GRF-Ala inclusion bodies were suspended in 2M 
citic acid at a concentration of 60 mg solids/ml (5-9 L) 

25 and sonicated at 70% power with a 50% pulse rate. The 
solubilized inclusion bodies were clarified by 
centrifugation at 20 # 000 g. The hCA-GRF-Ala fusion 
protein in the supernatant was precipitated by the 
addition of 10 N NaOH until the pH reaches 4-5. The 

3 0 precipitated hCA-GRF-Ala fusion protein was then 

collected by centrifugation at 2 0,000 g. After washing 
with 5% acetic acid/ 45% ethanol in Milli-Q H 2 0, the 
precipitated hCA-GRF-Ala fusion protein was again 
collected by centrifugation. The precipitated hCA-GRF- 

35 Ala fusion protein was then resuspended twice in 100 mM 
EDTA and the fusion protein was collected by 
centrifugation. The washed, precipitated hCA-GRF-Ala 
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fusion protein was suspended in distilled H 2 0. The 
suspension was homogenized and the hCA-GRF-Ala fusion 
protein was collected by centrifugation. 

5 Thrombin Cleavage of the hCA-GRF Fusion Protein 

A 200 ml portion of a solution containing 50mM NaOH 
and 0.25% N-lauroyl sarcosine was added to a bottle 
containing a pellet of the hCA-GRF-Ala fusion protein. 
The bottle was placed into 37 °C water bath to warm the 
10 pellet. The NaOH solution was swirled around in the 
bottle to remove all solid material from the bottle 
sides. The slurry was then poured into a suitable size 
beaker. The material was homogenized until all large . 
pieces are disaggregated and the pK was monitored. The 
15 pH was readjusted to between 11.6 and 11.9 with a 

solution of 50mM NaOH and 0.25% N-lauroyl sarcosine. 
The solution was sonicated until all of the inclusion 
body pellet has dissolved and solution is clear. 
Sonication times varied according to size of batch. 
20 Typically anything over 1 liter was sonicated for 2 
minutes, power 10, pulser on, 70% duty cycle (smaller 
batches required less time) . 

The protein concentration of the reaction solution 
was measured. If concentration was greater than 9 
25 mg/ml, the solution was diluted with the 50mM NaOH/0.25% 
N-lauroyl sarcosine solution to a protein concentration 
of 6-7 mg/ml. The protein solution was stirred 
vigorously, and the pK adjusted to 8-8.2 with 1M Tris- 
HC1. If the solution was slightly hazy at this point, 
30 the solution was clarified by filtering through glass 
fiber filters. The clarified solution was sterile 
filtered through a 0.45 /xm cellulose acetate membrane. 
The protein concentration was determined by measuring 
the absorbance at 280 nm. Sufficient NaCl to produce a 
35 250mM NaCl concentration was added with vigorous 
stirring. 

Sufficient thrombin stock solution (1 mg/ml) to 
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produce a protein to enzyme ratio of 5000:1 was added 
and the resulting mixture stirred in a water bath on top 
of stir plate at 37 °C. The reaction was monitored by 
HPLC on a C8 column eluted with a gradient of H 2 0/ACN/TFA 
5 buffers. The reaction was stopped by. the addition of 
PMSF to a final concentration of O.lraM when the fusion 
construct peak was essentially gone (usually 46-48 
hours) . The PMSF was dissolved into 95% EtOH and added 
directly to the reaction mixture with vigorous stirring. 

10 

Citric Acid Precipitation of the hCA- Fragment 

The human carbonic anhydrase fragment from the 
cleavage of the fusion protein and any residual uncut 
fusion protein were removed from solution by 

15 precipitation with 90mM citric acid, leaving the 

intermediate peptide, Ldr-GRF ( 1 -44 ) -Ala , in solution. 

The volume of the thrombin cut solution was 
measured in a graduated cylinder. A stock solution of 
1M citric acid was added to final concentration of 90mM 

20 citrate. The addition was made slowly while with 
vigorously stirring the solution. The aggregated 
material was centrifuged and the supernatant was 
filtered through a 0.4 5 /xm filter. The filtered 
supernatant was either frozen at -80°C or run directly 

25 onto a C8 column for desalting. 

The pellets from the precipitation were washed with 
a solution of 90 mM citrate pH 3.0-3.1. Between 150-200 
ml of citrate solution was added to each pellet of 1-2 
grams of hCA. The pellets were homogenized and the 

30 supernatant from the wash was filtered through a 0.45 jim 
filter and saved. The washed pellets were discarded. 
The supernatant was either frozen at -80°C or desalted 
directly on a C8 column. 



35 



C8 Desalting of Ldr-GRF (1-44) -Ala 

The 90 mM citric acid solution of Ldr-GRF (1-44) -Ala 
was loaded directly onto a preparative C8 column. The 
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column was eluted with a gradient of aqueous 
ethanol/acetic acid buffers. The fractions containing 
the Ldr-GRF (1-44) -Ala were collected (typically 90-95% 
pure) and the solution concentrated by evaporation to 
5 21-25 mg/ml. The resulting solution was stored at -80°C 
prior to transpeptidation. 

Transpeptidation of Ldr-GRF (1-44) -Ala 

The concentration Ldr-GRF (1-44) -Ala peptide was 

10 determined by C8 HPLC using 1 mM GRF standard. The 

peptide solution was diluted to a concentration of 3 mM 
Ldr-GRF (1-44) -Ala (18.0 mg/ml) with water. EDTA and 
sodium phosphate were added (to 5 mM; 1.9 mg/ml and 25 
mM; 3.6 mg/ml respectively) and the pH was adjusted to 

15 6.0 with 1 M NaOH. 2-Nitrophenylglycinamide [ONPGA] 
(250 mM; 89.0 m g/ml) was added and the pH was again 
adjusted to 6 . 0 with 1 M NaOH. Carboxypeptidase-Y 
("CPD-Y" ; 2 /il/ml) was the added and stirring was in the 
dark at 35-40 °C.' The extent of reaction to produce 

20 Ldr - GRF - ONPGA was monitored by HPLC. After about 1-2 
hours the reaction was stopped by the addition of 
acetonitrile to a final concentration of 15% v/v. 

The sample from the CPD-Y transpeptidation was 
.loaded onto a C8 HPLC column and rinsed with a 20% 

25 ethanol, 10 mM sodium phosphate, pH 6 . 8 buffer to remove 
unreacted ONPGA. The column was then eluted with a 5 0% 
EtOH, 10 mM Sodium Phosphate, pH 6 . 8 buffer and the Ldr- 
GRF -ONPGA peak was collected. 

3 0 Photolysis of Ldr - GRF - ONPGA 

The concentration of the Ldr - GRF - ONPGA peptide was 
determined by C8 HPLC using 1 mM GRF standard and the . 
concentration was diluted to 1 mM Ldr - GRF - ONPGA (6.0 
mg/ml) with 50% ethanol. Sodium bisulfite (to 5 mM) and 
3 5 sodium benzoate (to 50 mM ) were added and the solution 
is adjusted to pH 9.5 with 1 M NaOH. After the solution 
has been purged with Argon for 10-15 minutes, the 
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mixture was irradiated (wavelength >3 05 nm; 200-210 W 
medium pressure mercury lamp) while being maintained in 
a 20-25 °C circulating water bath. During the reaction 
a constant flow of Argon was maintained and the reaction 
5 was constantly stirred. The reaction was monitored by 
HPLC. After about 1-2 hours reaction was stopped by 
lowering the pH to 5.5 with acetic acid. 

Enterokinase Cleavage Protocol For Ldr-GRF-NE2 

10 The solution of Ldr-GRF-NK2 from the photolysis 

reaction was diluted 1:5 with water (1.0 mg/ml peptide) . 
Triton X-100 was added to a final concentration of 0.1%. 
Succinic acid and calcium chloride were added to produce 
concentrations of 50 mM (5.9 mg/ml) and 2 mM (0.3 mg/ml) 

15 respectively and the solution pH was adjusted to 5.5 

with 10 M NaOH. After the solution was filtered through 
a 0.4 5 /im membrane, 5.0 mg/ml Dowex 1 resin was added. 
The Dowex 1 resin, an anion exchange resin, was added to 
the reaction to bind the peptide containing the Asp-Asp- 

20 Asp- Asp (SEQ ID NO: 32) sequence. This was found to 

increase both the rate of the reaction and the overall 
yield, as well as reduce the concentration of 
enterokinase required while improving substrate 
"specificity. A 1:3000 ratio of enterokinase enzyme (1 

25 ul per 10 mg peptide) was added and the reaction was 
maintained in a 3 5-40 °C water bath with constant 
stirring. After 20-24 hours, the cleavage reaction 
which converts the Ldr-GRF-NH2 into GRF (1-44 ) -NH2 (SEQ 
ID NO:20) reached 70-80% completion. The reaction 

3 0 mixture was filtered to remove the Dowex 1 and the 

reaction was stopped by the addition of acetonitrile to 
a final concentration of 15%. The sample was stored at 
-80°C until ready for purification. If desired, 
purification of the GRF (1-44) -NH2 (SEQ ID NO:20) product 

3 5 may be carried out by preparative HPLC using a C8 
column. 
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FYam ple 1Q. Production of Peptide via Expression of 
a Soluble hCA Fusion Protein Construct 

E.coli cells transformed with a vector containing 
DNA coding for a hCA- 9 AA fusion protein may be prepared 
5 and cultured according to the procedures described 

above. The hCA- 9AA fusion protein includes hCAII linked 
to the amino acid sequence Thr-Asn-Thr-Gly-Ser-Gly-Thr- 
Pro-Ala ("9AA") (SEQ ID NO: 33) through an 
interconnecting peptide. The interconnecting peptide 
10 includes an enterokinase cleavage site positioned 

immediate adjacent the N-terminus of the 9AA (SEQ ID 
NO: 33) sequence. The hCA- 9AA fusion protein may be 
expressed as a soluble protein in E. coli BL21 F* ompT r B m" 
B {DE3) cells. 

15 The cell paste from the fermentor is diluted with 

cold wash buffer and chilled to 5-10°C. The chilled 
cell suspension is homogenized at 12,000 psi with Galin 
high pressure homogenizer. The. cell paste is passed 
through a heat exchanger and chilled to 10 °C prior to a 

2 0 second pass through the homogenizer. A PMSF stock 
solution in 95% ethanol is added to a final 
concentration of 0.05 mM PMSF. The lysate is cooled to 
about 8°C, passed through the homogenizer at 12,000 psi 
and drained into a holding tank. After passing the 

25 lysate through the homogenizer at 12,000 psi yet another 
time, a 5% v/v pH 7.5 solution of polyethyleneimine 
(PEI) may be added to a total concentration of 0.35% 
PEI. The resulting mixture is stirred for 20 minutes 
and clarified by Westfalia at 20,000 x g. The clarified 

30 solution is filtered through a 0.22 micro iim Pall filter 
system. 

A affinity column including p-aminomethylbenzene- 
sulfonamide- agarose resin is equilibrated with a 0.1M 
Tris-S0 4 pH 8.7 solution. The clarified supernatant from 
35 the PEI precipitation is loaded onto the column and the 
column is eluted with a series of Tris-S0 4 buffer 
solutions. The column fractions containing the fusion 
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protein construct are pooled. The protein construct is 
precipitated from the pooled fractions by lowering the 
pH of the solution to 4.0 with glacial acetic acid. The 
precipitated fusion protein construct may be isolated by 
5 centrifugation and lyophilized. 

The precipitated fusion protein construct is 
dissolved in a ph 8.0 buffer solution containing 50rnM 
Tris and ImM CaCl 2 . The mixture is sonicated until all 
the fusion protein construct has been dissolved. After 

10 the solution was filtered through a 0.45 urn membrane, 
5.0 mg/ml Dowex 1 resin was added. A 1:3000 ratio of 
enterokinase enzyme (1 ul per 10 mg peptide) was added 
and the reaction was maintained in a 35-40 °C water bath 
with constant stirring for 20-24 hours. .The reaction 

15 mixture was filtered to remove the Dowex 1 and the 

reaction was stopped by the addition of acetonitrile to 
a final concentration of 15%. A 3:1 volume of 15% (v/v) 
acetic acid is added to the solution and the . resulting 
mixture is stirred for 30 minutes, thereby precipitating 

20 the peptide fragment which includes hCA. The hCA 
fragment is removed by centrifugation yielding a 
supernatant which contains the 9AA (SEQ ID NO: 33) 
peptide. The supernatant may be desalted on a C-8 
column and stored at -80°C. 

25 The invention has been described with reference to 

various specific and preferred embodiments and 
techniques. However, It will be apparent to one of 
ordinary skill in the art that many variations and 
modifications may be made while remaining within the 

30 spirit and scope of the invention. 

All publications and patent applications in this 
specification are indicative of the level of ordinary 
skill in the art to which this invention pertains. All 
publications and patent applications are herein 

35 incorporated by reference to the same extent as if each 
individual publication or patent application was 
specifically and individually indicated by reference. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: BioNebraska, Inc. 

(ii) TITLE OF THE INVENTION: PRODUCTION OF PEPTIDES USING 
RECOMBINANT FUSION PROTEIN CONSTRICTS 

{iii) NUMBER OF SEQUENCES: 33 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Merchant & Gould 

(B) STREET: 3100 Norwest Center, 90 S. 7th Street 

(C) CITY: Minneapolis 

(D) STATE: MN 

(E) COUNTRY: U.S.A. 

(F) ZIP: 55402 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ Version 1.5 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 07-DEC-1995 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/350,530 

(B) FILING DATE: 07-DEC-1994 



(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Carter, Charles G 

(B) REGISTRATION NUMBER: 35,093 

(C) ^REFERENCE /DOCKET NUMBER: 8648.45USWO 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 612/332-5300 

(B) TELEFAX: 612/332-9081 

(C) TELEX: 



(2) INFORMATION FOR SEQ* ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . .159 
(D) OTHER INFORMATION: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GAC GAC GAC GAT AAA 
Asp Asp Asp Asp Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(y) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Asp Asp Asp Asp Lys 
15 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(A) NAME /KEY: Coding Sequence 

(B) LOCATION: 1. . .12 
(D) OTHER INFORMATION: 

(A) NAME /KEY: matjpeptide 

(B) LOCATION: 1. . .0 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

ATT GAA GGA AGA 
He Glu Gly Arg 
1 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
lie Glu Gly Arg 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . .24 
(D) OTHER INFORMATION: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1...0 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CAT CCT TTT CAT CTG CTG GTT TAT 
His Pro Phe His Leu Leu Val Tyr 
1 5 

(2) INFORMATION FOR SEQ ID NO : € : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

His Pro Phe His Leu Leu Val Tyr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
• AGCTTTCGTT GACGACGACG ATATCTT 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
AGCTAAGATA TCGTCGTCGT CAACGAA 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
AG CTG AATTC AACGTTCTCG AGGAT 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
ATCCTCGAGA ACGTTGAATT C 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOtll: 
CCCAAGCTTC TGTTCGTGGT CCGCGTTCTG TTTCTGAAA 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 
GAAACAGAAC GCGGACCACG AACAGAAGCT TGGG 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TCCAGCTGAT GCACAACCTG GGTAAACACC TGAACT 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

AGGTGTTTAC CCAGGTTGTG CATCAGCTGG ATTTCA 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 37 base pairs 
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(B) TYPE: nucleic acid 
{C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 
CTATGGAACG TGTTGAATGG CTGCGTAAAA AACTGCA 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ' ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 
TTTTTTACGC AGCCATTCAA CACGTTCCAT AGAGTTC 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GGACGTTCAC AACTTCTAAG ATATCCGG 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



WO 96/17942 



PCT/US95/15800 



47 



CCGGATATCT TAGAAGTTGT GAACGTCCTG CAG 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Ala Met Val Asp Asp Asp Asp Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

Tyr Ala Asp Ala lie Phe Thr Asn Ser Tyr Arg Lys Val Leu Gly Gin 

15 10 15 

Leu Ser Ala Arg Lys Leu Leu Gin Asp He Met Ser Arg Gin Gin Gly 

20 25 30 

Glu Ser Asn Gin Glu Arg Gly Ala Arg Ala Arg Leu 
35 40 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

Kis Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly 

15 10 15 

Gin Ala Ala Lys Glu Phe He Ala Trp Leu Val Lys 
20 25 
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(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL : NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPEt internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



Ser Val Ser Glu He Gin Leu Met His Asn Leu Gly Lys His Leu Asn 

15 10 15 

Ser Met Glu Arg Val Glu Trp Leu Arg Lys Lys Leu Gin Asp Val His 
20 25 30 

Asn Phe 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Tyr Ala Asp Ala He Phe' Thr Asn Ser Tyr Arg Lys Val Leu Gly Gin 

1 5 10 15 

Leu Ser Ala Arg Lys Leu Leu Gin Asp He Met Ser Arg Gin Gin Gly 

20 25 30 

Glu Ser Asn Gin Glu Arg Gly Ala Arg 
35 " 40 



(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

His Asp Glu Phe Glu Arg His Ala Glu Gly Thr Phe Thr Ser Asp Val 

15 10 15 

Ser Ser *Tyr Leu Glu Gly Gin Ala Ala Lys Glu Phe He Ala Trp Leu 

20 25 30 

Val Lys Gly Arg 
35 
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(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Gin Leu Met His Asn Leu Gly Lys His Leu Asn 

10 15 
Glu Trp Leu Arg Lys Lys Leu Gin Asp Val His 

25 30 
Gly Ala Pro Leu Ala Pro Arg Asp Ala Gly Ser 

40 45 
Lys Glu Asp Asn Val Leu Val Glu Ser His Glu 
. 55 ~ 60 

Ala Asp Lys Ala Asp Val Asn Val Leu Thr Lys 
7 0 " " 75 80 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

His Asp Glu Phe Glu Arg His Ala Glu Gly Thr Phe Thr Ser Asp Val 

15 10 15 

Ser Ser Tyr Leu Glu Gly Gin Ala Ala Lys Glu Phe He Ala Trp Leu 
20 25 30 

Val Lys 



Ser 


Val 


Ser 


Glu He 


1 






5 


Ser 


Met 


GlU 


Arg Val 








20 


A.sn 


Phe 


Val 


Ala Leu 






35 




Gin Arg 


Pro 


Arg Lys 
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Lys 


Ser 


Leu 


Gly Glu 


65 








Ala 


Lys 


Ser 


Gin 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
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His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly 

1 5 10 15 

Gin Ala Ala Lys Glu Phe He Ala Trp Leu Val Lys Gly Arg 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

Kis Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly 

15 10 15 

Gin Ala Ala Lys Glu Phe He Ala Trp Leu Val Lys Gly Arg Cys 
20 25 30 



.(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Ser Val Ser Glu He Gly Leu Met His Asn Leu Gly Lys, His Leu Asn 

1 5 10 15 

Ser Met Glu Arg Val Glu Trp Leu Arg Lys Lys Leu Gin Asp Val His 

20 25 30 

Asn Phe Val Ala Leu Gly 
35 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 
(Vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly 
15 10 15 
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Gin Ala Ala Lys Glu Phe He Ala Trp Leu Val Lys Ala Phe Ala 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 31: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Tyr Ala Asp Ala He Phe Thr Asn Ser Tyr Arg Lys Val Leu Gly Gin 

1 5 10 15 

Leu Ser Ala Arg Lys Leu Leu Gin Asp He Met Ser Arg Gin Gin Gly 

20 ' 25 30 

Glu Ser Asn Gin Glu Arg Gly Ala Arg Ala Arg Leu Ala 

35 40 45 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 



Asp Asp Asp Asp 
1 



(2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 



Thr Asn Thr Gly Ser Gly Thr Pro Ala 
1 5 
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WHAT IS CLAIMED IS: 

1. A method of producing a peptide comprising: 

precipitating a fusion protein construct, 
5 which includes a carbonic anhydrase and a variable 

fused polypeptide. 

2. The method of claim 1 comprising precipitating the 
fusion protein construct from a solution at a pH of 

10 about 3.2 to about 6.0. 

3. The method of claim 2 wherein precipitating the 
fusion protein construct further comprises adding a 
salt to the solution. 

15 

4. The method of claim 3 wherein the salt includes a 
divalent metal cation. 

5. The method of claim 2 wherein precipitating the 
20 fusion protein construct comprises adding an acid 

to the solution. 

6. The method of claim 2 wherein precipitating the 
fusion protein construct comprises adding an base 

25 to the solution. 

7. The method of claim 1 comprising precipitating the 
fusion protein construct from a solution which 
includes ammonium sulfate or sodium sulfate. 



30 



8. The method of claim 1 comprising precipitating the 
fusion protein construct from a solution which 
includes a chaotropic agent. 



35 9. 



The method of claim 8 wherein the chaotropic agent 
comprises guanidine hydrochloride. 
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10. The method of claim 1 comprising precipitating the 
fusion protein construct from a solution which 
includes urea. 

5 11. The method of claim 1 wherein isolating the fusion 
protein construct includes resolubilizing the 
precipitated fusion protein construct. 

12. The method of claim 11 comprising resolubilizing 
10 the precipitated fusion protein construct in a 

solution having a pH of at least about 10.. 

13. The method of claim 11 comprising resolubilizing 
the precipitated fusion protein construct in a 

15 solution having a pH of no more than about 3.2. 

14. The method of claim 11 comprising resolubilizing 
the precipitated fusion protein construct in a 
solution which includes a chaotropic agent or a 

20 detergent. 

15. The method of claim 1 further comprising cleaving 
the fusion protein construct to produce a soluble 
carbonic anhydrase fragment and a soluble variable 

25 fused polypeptide fragment. 

16. The method of claim 15 further comprising 
precipitating the carbonic anhydrase fragment. 

30 17. The method of claim 15 further comprising 

contacting a solution of the carbonic anhydrase 
fragment with a support which includes a 
benzenesulfonamide compound or a acetazolamide 
compound . 

35 

18. The method of claim 1 wherein the variable fused 
polypeptide comprises an amino acid sequence 
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corresponding to a peptide selected from the group 
consisting of GRP{1-41) (SEQ ID NO:23) , GLPK7-34) 
(SEQ ID N0:21>, and PTH(l-34) (SEQ ID NO:22). 

The method of claim 1 further comprising expressing 
the fusion protein construct as part of an 
inclusion body in a host cell. 

The method of claim 19 further comprising 
dissolving the inclusion body to produce a 
solubilized fusion protein construct. 

The method of claim 2 0 comprising dissolving the 
inclusion body in a solution which includes citric 
acid. 

The method of claim 20 comprising dissolving the 
inclusion body in a solution which includes a 
chaotropic agent or a detergent. 

The method of claim 20 comprising dissolving the 
inclusion body in a solution having a pH of at 
least about 10 . 

The method of claim 1 wherein the variable fused 
polypeptide includes at least 2 copies of a target 
peptide . 

The method of claim 24 wherein the variable fused 
polypeptide includes an amino acid sequence having 
the formula: 

-TargP- (CS2) - [ - (Lnl) n - (CS1) m -TargP- (CS2 ) - ] r 
wherein the -CS1- and -CS2- are cleavage 
sites, the -(Lnl)- is a linking sequence, the 
-TargP- is the target peptide, n and m are 0 or 1, 
and r is an integer from 1 to about 150. 
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The method of claim 25 wherein the target peptide 
includes an amino acid sequence corresponding to a 
peptide selected from the group consisting of 
GRFtl-41) (SEQ ID NO:23), GLPK7-34) (SEQ ID N0:21) 
and PTH(l-34) (SEQ ID NO:22). 

The method of claim 1 wherein the carbonic 
anhydrase is linked to the variable fused 
polypeptide by a cleavage site. 

The method of claim 27 wherein the cleavage site is 
a chemical cleavage site. 

The method of claim 27 wherein the cleavage site is 
an enzymatic cleavage site recognized by an 
en'dopeptidase . 

A method of producing a peptide comprising: 

cleaving a fusion protein construct, which 
includes a carbonic anhydrase and a variable fused 
polypeptide, to produce a soluble carbonic 
anhydrase fragment and a soluble variable fused 
polypeptide fragment; and 

precipitating the carbonic anhydrase fragment. 

The method of claim 30 further comprising 
extracting the precipitated carbonic anhydrase 
fragment with a solvent to recover an additional 
amount of the variable fused polypeptide fragment, 
wherein the solvent includes an organic component. 

The method of claim 31 wherein the organic solvent 
solvent comprises acetonitrile , propanol, citric 
acid, polyethyleneglycol , or mixtures thereof. 

The method of claim 30 further comprising isolating 
the fusion protein construct. 
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The method of claim 33 wherein isolating the fusion 
protein construct includes precipitating the fusion 
protein construct. 

The method of claim 30 comprising precipitating the 
soluble carbonic anhydrase fragment from a solution 
at a pH of about 3.2 to about 6.0. 

The method of claim 35 wherein precipitating the 
soluble carbonic anhydrase fragment further 
comprises adding a salt to the solution. 

The method of claim 36 wherein the salt includes a 
divalent metal cation. 

The method of claim 33 wherein isolating the fusion 
protein construct includes contacting a solution of 
the fusion protein construct with a support which 
includes a benzenesulf onamide compound or a 
acetazolamide compound. 

The method of claim 38 wherein contacting the 
fusion protein construct solution with the support 
includes passing the fusion protein construct 
solution through an affinity column which includes 
a sulfanilamide compound. 

The method of claim 3 3 further comprising 
expressing the fusion protein construct as part of 
an inclusion body in a host cell. 

The method of claim 33 wherein isolating the fusion 
protein construct comprises adding PEI to a 
solution of the fusion protein construct. 

The method of claim 41 further comprising adjusting 
the pH of the PEI solution to between about 3.2 and 
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about 6.0, thereby precipitating the fusion protein 
construct . 

43. The method of claim 30 wherein the carbonic 
5 anhydrase is linked to the variable fused 

polypeptide through an enzymatic cleavage site; and 
wherein cleaving the fusion protein 
construct includes treating the fusion protein 
construct with an endopeptidase selected from 
10 the group consisting of enterokinase , Factor 

Xa, ubiquitin cleaving enzyme, thrombin, 
trypsin, renin, subtilisin, chymotrypsin, 
clostripain, and S. aureus V8 . 

15 44. The method of claim 43 wherein cleaving the fusion 
protein construct comprises treating the fusion 
protein construct, which is dissolved in a solution 
which includes sodium chloride, with thrombin. 

20 45. The method of claim 43 wherein cleaving the fusion 
protein construct comprises adding a anion exchange 
resin to a solution of the fusion protein construct 
and treating the fusion protein construct with 
enterokinase . 

25 

46. The method of claim 30 wherein the carbonic 
anhydrase is linked to the variable fused 
polypeptide through a chemical cleavage site; and 
wherein cleaving the fusion protein construct 
3 0 includes treating the fusion protein construct with 

a chemical cleavage agent selected from the group 
consisting of cyanogen bromide, hydroxylamine , 
BNPS-skatole, an S-cyanylating agent and an acid 
having a pK a of no more than about 3.0. 



35 



47. An inclusion body expressed in an E. coli host cell 
comprising a fusion protein construct, wherein the 
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fusion protein construct includes a carbonic 
anhydrase and a target peptide, said target peptide 
including an amino acid sequence corresponding to a 
peptide selected from the group consisting of 
5 GRF(1-41) (SEQ ID NO:23), GLPK7-34) (SEQ ID N0:21) 

and PTH(l-34) (SEQ ID NO:22) . 

48. The inclusion body of claim 47 wherein the fusion 
protein construct includes at least 2 copies of a 

10 target peptide. 

49. The inclusion body of claim 47 wherein the target 
peptide includes an amino acid sequence 
corresponding to PTH(l-34) (SEQ ID NO:22); and 

15 wherein the carbonic anhydrase is linked to the 

target peptide through an interconnecting peptide 
which includes an enzymatic cleavage site 
recognized by thrombin. 

20 50. The inclusion body of claim 47 wherein the target 
peptide includes an amino acid sequence 
corresponding to GRF(1-41) (SEQ ID NO:23) ; and 
wherein the fusion protein construct further 
comprises an enzymatic cleavage site recognized by 

25 an enzyme selected from the group consisting of 

thrombin, enterokinase and carboxypeptidase Y. 

51. The inclusion body of claim 48 wherein the target 
peptide includes an amino acid sequence 

30 corresponding to GRF(l-44) (SEQ ID NO:20). 

52 . The inclusion body of claim 47 wherein the target 
peptide includes an amino acid sequence 
corresponding to GLPK7-34) (SEQ ID N0:21); and 

35 wherein the fusion protein construct further 

comprises an enzymatic cleavage site recognized by 
trypsin. 
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53 . The inclusion body of claim 47 wherein the target 
peptide includes an amino acid sequence 
corresponding to GLPK7-34) (SEQ ID NO: 21); and 
wherein the fusion protein construct further 
5 comprises a methionine residue. 



54. A method of producing a peptide comprising: 

expressing a fusion protein construct as a 
part of an inclusion body in an E. coli host cell, 
10 wherein the fusion protein construct includes a 

carbonic anhydrase and a variable fused 
polypeptide ; and 

isolating the fusion protein construct. 

15 55. The method of claim 54 wherein the variable fused 
polypeptide comprises a target peptide which 
includes an amino acid sequence corresponding to a 
peptide selected from the group consisting of 
GRF(1-41> (SEQ ID NO:23), GLPl(7-34) (SEQ ID NO:21) 

20 and PTH(l-34) (SEQ ID NO: 22) . 

56. The method of claim 54 wherein the variable fused 
polypeptide includes at least 2 copies of a target 
peptide . 

25 

57. A fusion protein construct comprising a carbonic 
anhydrase and a variable fused polypeptide, wherein 
the variable fused polypeptide includes a target 
peptide, the target peptide including an amino acid 

30 sequence corresponding to a peptide selected from 

the group consisting of GRFll-41) (SEQ. ID NO:23), 
GLPK7-34) (SEQ ID NO:21) and PTHU-34) (SEQ ID 
NO:22) . 



35 58. 



The fusion protein construct of claim 57 wherein 
the variable fused polypeptide includes a target 
peptide corresponding to a peptide selected from 
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the group consisting of GRF(1-41) (SEQ ID 1X0:23), 
GRFCL-44) {SEQ ID NO:20), GLPKl-34) (SEQ ID 
NO:26), GLPK7-34) (SEQ ID NO: 21) , GLPl(7-36) (SEQ 
ID NO:27), GLP1 (7-37) (SEQ ID 1*0:28), PTH(l-34) 
5 (SEQ ID NO:22), PTH(l-38) (SEQ ID NO:29) and PTH(1- 

84) (SEQ ID NO:25) . 



10 
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59. A recombinant gene containing a DNA sequence coding 
for the fusion protein construct of claim 57. 



60. An expression cassette comprising: 

a nucleic acid sequence coding for the fusion 
protein construct of claim' 57; and 

wherein the nucleic acid sequence is operably 
15 linked to a promoter functional in a vector. 

61. An expression vector comprising the expression 
cassette of claim 60. 

20 62. A transformed cell comprising a recombinant gene 
including a DNA sequence coding for the fusion 
protein construct of claim 57. 

63. A method of producing a peptide comprising: 
25 cleaving a fusion protein construct, which 

includes a carbonic anhydrase and a variable fused 
polypeptide, to produce a carbonic anhydrase 
fragment and a variable fused polypeptide fragment; 
precipitating the carbonic anhydrase fragment 
30 and the variable fused polypeptide fragment; and 

extracting the precipitated fragments with a 
solvent which includes an organic component, 
thereby recovering the variable fused polypeptide 
fragment. 



64. The method of claim 63 wherein the organic solvent 
solvent comprises acetonitrile, propanol, citric 
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acid, polyethyleneglycol , or mixtures thereof. 

65. The method of claim 63 wherein the variable fused 
polypeptide includes at least two copies of a 
5 target peptide. 
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